Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases
نویسندگان
چکیده
In this article, a novel combined i-vector and an Extreme Learning Machine (ELM) is proposed for speaker identification. The ELM chosen because it fast to train has universal approximator property. Four combinations of features based on Mel Frequency Cepstral Coefficient Power Normalized are used. Besides, seven fusion methods exploited. system evaluated with three different databases, namely: the SITW 2006, NIST 2008, TIMIT database. This work employs 2016 database first time identification using integration between approach. From each database, 120 speakers 1200 speech utterances used (overall 360 3600 utterances). Furthermore, comprehensive evaluations exploited wide range realistic background noise types (Stationary AWGN Non-Stationary Noise types) handset effect. compared Gaussian Mixture Model-Universal Background Model (GMM-UBM) other states art approaches. results show that method outperforms GMM-UBM approach state- of-the-art under specific conditions, techniques can be improve robustness effects.
منابع مشابه
The NIST 2014 Speaker Recognition i-Vector Machine Learning Challenge
During late-2013 through mid-2014 NIST coordinated a special machine learning challenge based on the i-vector paradigm widely used by state-of-the-art speaker recognition systems. The i-vector challenge was run entirely online and used as source data fixed-length feature vectors projected into a low-dimensional space (i-vectors) rather than audio recordings. These changes made the challenge mor...
متن کاملThe 2016 NIST Speaker Recognition Evaluation
In 2016, the National Institute of Standards and Technology (NIST) conducted the most recent in an ongoing series of speaker recognition evaluations (SRE) to foster research in robust text-independent speaker recognition, as well as measure performance of current state-of-the-art systems. Compared to previous NIST SREs, SRE16 introduced several new aspects including: an entirely online evaluati...
متن کاملCRSS systems for the NIST i-Vector Machine Learning Challenge
This paper describes the systems developed by the Center for Robust Speech Systems (CRSS), Univ. of Texas Dallas, for the National Institute of Standards and Technology (NIST) iVector challenge. Since the emphasis of this challenge is on utilizing unlabeled development data, our system development focuses on: 1) unsupervised clustering methods to estimate development data labels; 2) build effic...
متن کاملUTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation
This study describes systems submitted by the Center for Robust Speech Systems (CRSS) from the University of Texas at Dallas (UTD) to the 2016 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE). We developed 4 UBM and DNN i-vector based speaker recognition systems with alternate data sets and feature representations. Given that the emphasis of the NIST SR...
متن کاملdesigning and validating a textbook evaluation questionnaire for reading comprehension ii and exploring its relationship with achievement
در هر برنامه آموزشی، مهم ترین فاکتور موثر بر موفقیت دانش آموزان کتاب درسی است (مک دونو و شاو 2003). در حقیقت ، کتاب قلب آموزش زبان انگلیسی است( شلدن 1988). به دلیل اهمیت والای کتاب به عنوان عنصر ضروری کلاس های آموزش زبان ، کتب باید به دقت ارزیابی و انتخاب شده تا از هرگونه تاثیر منفی بر دانش آموزان جلوگیری شود( لیتز). این تحقیق با طراحی پرسش نامه ارزیابی کتاب که فرصت ارزیابی معتبر را به اساتید د...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Circuits Systems and Signal Processing
سال: 2021
ISSN: ['0278-081X', '1531-5878']
DOI: https://doi.org/10.1007/s00034-021-01697-7